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Abstract. We study approximation algorithms for the following geo¬ 
metric version of the maximum coverage problem: Let V he a set of n 
weighted points in the plane. We want to place m a x b rectangles such 
that the sum of the weights of the points in V covered by these rectan¬ 
gles is maximized. For any fixed £ > 0, we present efficient approximation 
schemes that can find a (1 — £)-approximation to the optimal solution. 
In particular, for m = 1, our algorithm runs in linear time 0(nlog(-)), 
improving over the previous result. For m > 1, we present an algorithm 
that runs in 0(= log(i) -h time. 

Keywords: Maximum Coverage, Geometric Set Cover, Polynomial-Time 
Approximation Scheme 


1 Introduction 

The maximum coverage problem is a classic problem in theoretical computer 
science and combinatorial optimization. In this problem, we are given a universe 
V of weighted elements, a family of subsets and a number k. The goal is to 
select at most k of these subsets such that the sum of the weights of the cov¬ 
ered elements in V is maximized. It is well-known that the most natural greedy 
algorithm achieves an approximation factor of 1 — 1/e, which is essentially op¬ 
timal (unless P=NP) [22127118] . However, for several geometric versions of the 
maximum coverage problem, better approximation ratios can be achieved (we 
will mention some of such results below). In this paper, we mainly consider the 
following geometric maximum coverage problem: 

Definition 1. /MaxCovR(7^, to)/ Let V be a set of n points in a 2-dimensional 
Euclidean plane K^. Each point p G V has a given weight Wp > 0. The goal of 

*Jian Li, Bowei Zhang and Ningye Zhang’s research was supported in part by 
the National Basic Research Program of China Grant 2015CB358700, 2011CBA00300, 
2011CBA00301, the National Natural Science Foundation of China Grant 61202009, 
61033001, 61361136003. Haitao Wang’s research was supported in part by NSF under 
Grant CCF-1317143. 





2 


Jian Li^, Haitao Wang^, Bowei Zhang^, and Ningye Zhang^ 


our geometric max-coverage problem (denoted as MaxCovR(7^, m)^ is to place m 
a X h rectangles such that the sum of the weights of the covered points by these 
rectangles is maximized. More precisely, let S be the union of m rectangles we 
placed. Our goal is to maximize 

Cover(7^, S) = E Wp. 

pevns 

We also study the same coverage problem with unit disks, instead of rect¬ 
angles. We denote the corresponding problem as MaxCovD(7^, to). One natural 
application of the geometric maximum coverage problem is the facility placement 
problem. In this problem, we would like to locate a certain number of facilities 
to serve the maximum number of clients. Each facility can serve a region (de¬ 
pending on whether the metric is Li or L 2 , the region is either a square or a 
disk). 

1.1 m — 1 

Previous Results : We first consider MaxCovR(7^, 1). Imai and Asano [3S], Nandy 
and Bhattacharya [26] gave two different exact algorithms for MaxCovR(7^, 1), 
both running in time O(nlogn). It is also known that solving MaxCovR(7^, 1) 
exactly in algebraic decision tree model requires l7(nlogn) time [5|. Tao et 
al. |55| proposed a randomized approximation scheme for MaxCovR(7^, 1). With 
probability I — 1/n, their algorithm returns a (1 — £)-approximate answer in 
0(nlog(i) -l-nloglogn) time. In the same paper, they also studied the problem 
in the external memory model. 

Our Results : For MaxCovR(7^, I) we show that there is an approximation scheme 
that produces a (I — e)-approximation and runs in 0(nlog(j)) time, improving 
the result by Tao et al. |28|. 

1.2 General m > 1 

Previous Results : Both MaxCovR(7^, to) and MaxCovo)?^, to) are NP-hard if to 
is part of the input [53]. The most related work is de Berg, Cabello and Har- 
Peled [T3]. They mainly focused on using unit disks (i.e., MaxCovD(7^, to)). They 
proposed a (1 — £)-approximation algorithm for MaxCovD(7^, to) with time com¬ 
plexity 0{n{m/e)^^'^'^). 

[^We note that their algorithm can be easily extended to MaxCovR with the 
same time complexity. 

We are not aware of any explicit result for MaxCovR(P, to) for general to > 1. 

It is known JT^ that the problem admits a PTAS via the standard shifting 
technique |2I|-[j 

^ They were mainly interested in the case where m is a constant. So the running 
time becomes 0(n(l/e)'^^'^^) (which is the bound claimed in their paper) and the 
exponential dependency on m does not look too bad for m = 0(1). Since we consider 
the more general case, we make the dependency on m explicit. 

^ Hochbaum and Maass mi obtained a PTAS for the problem of covering given 
points with a minimal number of rectangles. Their algorithm can be easily modified 
into a PTAS for MaxCovR)!^,m) with running time 
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Our Results : Our main result is an approximation scheme for MaxCovR(7^, m) 
which runs in time 


O 




where A = 0(min(-y/m, j)). Our algorithm can be easily extended to other 
shapes. In Appendix]^ we sketch an extension for approximating MaxCovD(7^, m). 
The running time of our algorithm is 


/ /] 


1 \ 4 \ 


:) +>»(; 

;) 


Following the convention of approximation algorithms, £ is a fixed constant. 
Hence, the second term is essentially 0{m) and the overall running time is es¬ 
sentially linear 0{n). Our algorithm follows the standard shifting technique [5T], 
which reduces the problem to a smaller problem restricted in a constant size cell. 
The same technique is also used in de Berg et al. m- They proceeded by first 
solving the problem exactly in each cell, and then use dynamic programming to 
find the optimal allocation for all cells. 

Our improvement comes from another two simple yet useful ideas. First, we 
apply the shifting technique in a different way and make the side length of grids 
much smaller (0(j), instead of 0{m) in de Berg et al.’s algorithm [I3])- Second, 
we solve the dynamic program approximately. In fact, we show that a simple 
greedy strategy (along with some additional observations) can be used for this 
purpose, which allows us to save another 0(m) term. 


1.3 Other Related Work 

There are many different variants for this problem. We mention some most re¬ 
lated problems here. 

Barequet et al. [J], Dickerson and Scharstein m studied the max-enclosing 
polygon problem which aims to find a position of a given polygon to cover 
maximum number of points. This is the same as MaxCovR(7^, 1) if a polygon 
is a rectangle. Imai et al. |23j gave an optimal algorithm for the max-enclosing 
rectangle problem with time complexity 0(n log n). 

MaxCovo)?^, m) was introduced by Drezner [16]. Chazelle and Lee [10] gave 
an 0(n^)-time exact algorithm for the problem MaxCovD('P, 1). A Monte-Carlo 
(1 — e)-approximation algorithm for MaxCovD(7^, 1) was shown in [T], where V is 
an unweighted point set. Aronov and Har-Peled [3] showed that for unweighted 
point sets an 0{ne~^ logn) time Monte-Carlo (1 — £)-approximation algorithm 
exists, and also provided some results for other shapes, de Berg et al. m pro¬ 
vided an 0{n£~^) time (1 — £)-approximation algorithm. 

^ In fact, their dynamic programming runs in time at least 17(m^). Since they 
focused on constant m, this term is negligible in their running time. But if m > ^/n, 
the term can not be ignored and may become the dominating term. 
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For TO > 1, MaxCovD(7 ^,to) has only a few results. For to = 2, Cabello et 
al. [8] gave an exact algorithm for this problem when the two disks are disjoint in 
log^ n) time, de Berg et al. [13] gave (1—e)-approximation algorithms that 
run in log^™”^ (1/^)) time for to > 3 and in log (1/s:)) 

time for m = 2,3. 

The dual of the maximum coverage problem is the classical set cover prob¬ 
lem. The geometric set cover problem has enjoyed extensive study in the past two 
decades. The literature is too vast to list exhaustively here. See e.g., [711111712512^ 
and the references therein. 

2 Preliminaries 

We first define some notations and mention some results that are needed in our 
algorithm. Denote by Gs{a,b) the square grid with mesh size 6 such that the 
vertical and horizontal lines are defined as follows 

Gs{a, b) = {{x,y) G \ y = b + k ■ S,k G 'E"^\j{{x,y) G \ X = a + k ■ S,k G 

Given Gs{a,b) and a point p = (x,y), we call the integer pair {\x/5\, \jj/5\) the 
index of p (the index of the cell in which p lies in). 

Perfect Hashing: Dietzfetbinger et al. m shows that if each basic algebraic 
operation (including {-I-,—, x,log 2 ,exp 2 }) can be done in constant time, we 
can get a perfect hash family so that each insertion and membership query takes 
0(1) expected time. In particular, using this hashing scheme, we can hash the 
indices of all points, so that we can obtain the list of all non-empty cells in 0{n) 
expected time. Moreover, for any non-empty cell, we can retrieve all points lies 
in it in time linear in the number of such points. 

Linear Time Weighted Median and Selection: It is well known that finding the 
weighted median for an array of numbers can be done in deterministic worst- 
case linear time. The setting is as follows: Given n distinct elements Xi,X 2 , ■■■, Xn 
with positive weights wi,W 2 , Let w = The weighted median is 

the element Xk satisfying < 0 :^ >xk — ^/2- Finding the 

kth smallest elements for any array can also be done in deterministic worst-case 
linear time. See e.g., [12]. 

An Exact Algorithm for MaxCovR(7^, 1): As we mentioned previously, Nandy and 
Bhattacharya w provided an O(nlogn) exact algorithm for the MaxCovR(7^, 1) 
problem. We are going to use this algorithm as a subroutine in our algorithm. 

3 A Linear Time Algorithm for MaxCovR(P, 1) 

Notations: Without loss of generality, we can assume that a = b = 1, i.e., all 
the rectangles are 1x1 squares, (by properly scaling the input). We also assume 
that all points are in general positions. In particular, all coordinates of all points 
are distinct. For a unit square r, we use w{r) to denote the sum of the weights of 
the points covered by r. We say a unit square r is located at {x, y) if the top-left 
corner of r is (x, y). 

Now we present our approximation algorithm for MaxCovR(7^, 1). 
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3.1 Grid Shifting 

Recall the definition of a grid Gs{a, b) (in Section]^. Consider the following four 
grids: G2(0, 0),G2(0,1),G2(1, 0),G2(1,1) with <5 = 2. We can easily see that for 
any unit square r, there exists one of the above grids that does not intersect 
r (i.e., r is inside some cell of the grid). This is also the case for the optimal 
solution. 

Now, we describe the overall framework, which is similar to that in |28| . Our 
algorithm differs in several details. MaxCovCell(c) is a subroutine that takes 
a 2 X 2 cell c as input and returns a unit square r that is a (l-e)-approximate 
solution if the problem is restricted to cell c. We present the details of Max- 
CovCell in the next subsection. 


Algorithm 1 MaxCovR(7^, 1) 

n^max ^ 0 

for each G G {G2(0, 0), G2(0,1), G 2 (l, 0), G 2 (l, 1)} do 

Use perfect hashing to find all the non-empty cells of G. 
for each non-empty cell c of G do 
r -s— MaxCovCell(c). 

If w{r) > lUmax, then Wmax w{r) and r^ax r. 

end for; 
end for; 
return rmax; 


As we argued above, there exists a grid G such that the optimal solution is 
inside some cell c* € G. Therefore, MaxCovCell(c*) should return a (1-e)- 
approximation for the original problem MaxCovR(7^, 1). 

3.2 MaxCovCell 

In this section, we present the details of the subroutine MaxCovCell. Now 
we are dealing with the problem restricted to a single 2x2 cell c. Denote the 
number of point in c by ric, and the sum of the weights of points in c by Wc- We 
distinguish two cases, depending on whether ric is larger or smaller than (j) . 

If ric < (e)^j we simply apply the O(nlogn) time exact algorithm. [26] 

The other case requires more work. In this case, we further partition cell c 
into many smaller cells. First, we need the following simple lemma. 

Lemma 1. Given n points in with positive weights Wi,W 2 , ■■■, Wn, ~ 

w. Assume that xi,X2, ■■■,Xn are their distinct x- coordinates. We are also given 
a value Wd such that max(z(;i, W 2 > ■■■,Wn) ^ Wd < w, Then, we can find at most 
2w/wd vertical lines such that the sum of the weights of points strictly between 
(we do not count the points on these lines) any two adjacent lines is at most Wd 
in time 0{nlog{w/wd)). 

Proof. See Algorithm]^ In this algorithm, we apply the weighted median algo¬ 
rithm recursively. Initially we have a global variable G = 0, which upon termi¬ 
nation is the set of cc-coordinates of the selected vertical lines. Each time we find 
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Algorithm 2 Partition({xi, a; 2 ,Xn}) 

Find the weighted median Xk (w.r.t. in-weight); 

C = CU {xk}-, 

Generate 5" = {xi | Wi < Xk}, L = {xi \ Wi > Xk}', 

If the sum of the weights of the points in S is lager than Wd, run Partition(S); 
If the sum of the weights of the points in L is lager than Wd, run Partition(L); 


the weighted median Xk and separate the point with the vertical line x = Xk, 
which we add into C. The sum of the weights of points in either side is at most 
half of the sum of the weights of all the points. Hence, the depth of the recursion 
is at most [log(t(;/t(;(i)]. Thus, the size of C is at most 2 d°g(’"/“d)l < 2w/wd, and 
the running time is 0{n\og{w/wd))- □ 

Now, we describe how to partition cell c into smaller cells. First we partition 
c with some vertical lines. Let Cy to denote a set of vertical lines. Initially, £ = 0. 
Let Wd = We find all the points whose weights are at least Wd- For each 
such point, we add to £„ the vertical line that passes through the point. Then, 
we apply Algorithmto all the points with weights less than Wd- Next, we add 
a set Ch of horizontal lines in exactly the same way. 

Lemma 2. The sum of the weights of points strictly between any two adjacent 
lines in £„ is at most Wd = ■ The number of vertical lines in Cy is at most 

Both statements hold for as well. 

Proof. The first statement is straightforward from the description of the algo¬ 
rithm. We only need to prove the upper bound of the number of the vertical lines. 
Assume the sum of the weights of those points considered in the first (resp. sec¬ 
ond) step is FFi(resp. IF 2 ), Wi + W 2 = Wc. The number of vertical lines in £„ 
is at most 



The first term is due to the fact that the weight of each point we found in the 
first step has weight at least ^ 3 ^, and the second term directly follows from 
Lemma [T] □ 

We add both vertical boundaries of cell c into £„ and both horizontal bound¬ 
aries of cell c into £h- Now £ — £y Li Ch forms a grid of size at most -I- 

2) X +2). Assume £ = {ix,y) G M.^ \ y = y^J G {1,..., u}} U {{x, y) G 

\ X = Xi,i G {!,... ,i;}}, with both {yi} and {xi\ are sorted. £ partitions c 
into small cells. The final step of our algorithm is simply enumerating all the 
unit squares located at {xi,yj),i G {!,..., it}, j G {!,..., r}, and return the one 
with the maximum coverage. However, computing the coverage exactly for all 
these unit squares is expensive. Instead, we only calculate the weight of these 
unit square approximately as follows. For each unit square r, we only count the 
weight of points that are in some small cell fully covered by r. Now, we show 
this can be done in O (riclog (j) -I- (j)^) time. 
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After sorting {yi\ and {x^}, we can use binary search to identify which small 
cell each point lies in. So we can calculate the sum of the weights of points at 
the interior, edges or corners of all small cells in 0(nclog (i)) times. 

Thus searching the unit square with the maximum (approximate) coverage 
can be done with a standard incremental algorithm in O time. 

For completeness, we provide the details as follows. In fact, the problem can 
be reduced to the following problem: We have a, k x k grid, each cell {i,j) has 
a given weight w{i,j). We are also given two non-decreasing integer function]^ 
X : [k] ^ [k],Y : [/c] —>■ [A:] such that X{i) > i,Y{i) > i for all i G [/c]. The goal 
is to find i*,j* such that 

Hhj)= H 

is maximized. First,we define an auxiliary function H(i,j) = 

This can be computed separately for each column. We maintain two pointers 
p and < 7 , such that q = Y{p) holds through the process. Initially, p = I and 
<7 = y(I). In each iteration, we increment p by 1 and g by y(p-|-1) — y(p). Each 
w{i,j) will be encountered at most twice. So we can calculate all H{i,j) values 
in O (j) time. Then in the same way. We can use H{i,j) values to calculate all 

I(i,j) values in exactly the same way. This takes O time as well. We return 
the maximum value as the result for MaxCovCell(c). 

Putting everything together, we conclude that if > (“) , the running time 
of MaxCovCell(c) is O (jic log (i) -b (■ We can conclude the main result 
of this section with the following theorem. 

Theorem 1. Algorithm^returns a (1-e)-approximate answer for MaxCovR(7^, 1) 
mO(nlog(j)) time. 

Proof. We only need to prove that MaxCovCell(c) returns a (l-e)-approximation 
for cell c. The case < (i) is trivial since we apply the exact algorithm. So 

we only need to prove the case of Uc > (j)^. 

Suppose the optimal unit square is r. Denote by Opt the weight of the optimal 
solution. The size of c is 2 x 2, so we can use 4 unit squares to cover the entire 
cell. Therefore, Opt > (^). Suppose r is located at a point p, which is in the 
strict interior of a small cell B separated by £.[^ Suppose the index of B is (z, j). 
We compare the weight of r with I{i,j) (which is the approximate weight of the 
unit square located at the top-left corner of B). See Figure By the rule of our 
partition, the weight difference is at most 4 times the maximum possible weight 
of points between two adjacent lines in C. So /(*, j) > Opt —4- > (1 —e)Opt. 

This proves the approximation guarantee of the algorithm. 

Now, we analyze the running time. The running time consists of two parts: 
cells with number of points more than (j) and cells with number of points less 

"[k]={l,2, ... ,k} 

®If p lies on the boundary of B, the same argument still works. 
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Fig. 1. Proof of Theorem]^ Difference between the optimal solution(the top-left one) 
and our solution (the lower-right one) 


than (^)^. Let ni > 712 nj > > rij+i > 71 ^+ 2 , nj+k be the 

sorted sequence of the number of points in all cells. Then, we have that 


Running time < O log(7li_|_j)) 

=0 ^log Ylrii+j -h ^ n,+j log(ni+j)^ 

<0 l^log ^(ni) + « + ^ ni+j log Q 
=0 Aog ^{ui) -brz^ = O ("77log 


i=l 


This completes the proof. 


□ 


4 Linear Time Algorithms for MaxCovR(P, m) 

4.1 Grid Shifting 

For general m, we need the shifting technique |21j . Consider grids with a different 
side length: G6/e(a, b). We shift the grid to | different positions: (0,0), (1,1),...., ( 

1.1 — 1). (For simplicity, we assume that ^ is an integer and no point in V has 
an integer coordinate, so points in V will never lie on the grid line. Let 

G = {G6/e(0,0),..., G6/£(6/e — 1,6/e — 1)} • 

Then we have the following lemma. 


01 105 
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Fig. 2. Proof of Lemma the shifting technique. 


Lemma 3. There exist G* G G and a (1— -approximate solution R such that 
none of the unit squares in R intersects G*. 

Proof. For any point p, we can always use four unit squares to cover the 2x2 

square centered at p. Therefore, there exists an optimal solution OPT such that 

each covered point is cover by at most 4 unit squares in OPT. For each grid 

Ge.{i,i) G <G, we build a modified answer R^ from OPT in the following way. 
£ 

For each square r that intersects with Ge (z, z), there are two different situations. 
If r only intersects with one vertical line or one horizontal line. We move the 
square to one side of the line with bigger weight. In this case we will lose at 
most half of the weight of r. Notice that this kind of squares can only intersect 
with two grids in G. Similarly, If r intersects with one vertical line and one 
horizontal line at the same time, we move it to one of the four quadrants derived 
by these two lines. In this case we will lose at most 3/4 of the weight of r. This 
kind of squares can only intersect with one grid in G. (see Figure Now we 
calculate the sum of the weights we lose from Rq, i? 2 ,..., Rs_i, which is at most 
max{l/2 X 2, 3/4x1} = ! times the sum of weights of squares in OPT. By 
the definition of OPT, it is at most Aw{OPT). So the sum of the weights of 
Ro,R 2 ,...,Re_i is at least (- — T)w{OPT). Therefore there exists some i such 
that i?i(which does not intersect G6{i,i)) is a (1 — approximate answer. □ 

We present a subroutine in section |4.4| which can approximately solve the 
problem for a grid, and apply it to each non-empty grid in G. Then, in order 
to compute our final output from those obtained solutions, we apply a dynamic 
programming algorithm or a greedy algorithm which are shown in the next two 
sections. 

4.2 Dynamic Programming 

Now consider a fixed grid G G G. Let Ci,..., c* be the cells of grid G and Opt be 
the optimal solution that does not intersect G. Obviously, (|)^ unit squares are 
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enough to cover an entire f x | cell. Thus the maximum number of unit squares 
we need to place in one single cell is rric = min{m, (f )^}. 

Let Opt(ci,fc) be the maximum weight we can cover with k unit squares 
in cell Ci- For each nonempty cell Ci and for each k G [wc], we find a (1 — |)- 
approximation F(ci, k) to Opt(ci, k). We will show how to achieve this later. Now 
assume that we can do it. 

Let Optp(m) be the optimal solution we can get from the values F(ci,fc). 
More precisely, 

Optp( m) = max 


F(ci, ki) \ fcz - ' 


( 1 ) 


We can see that Optp(m) must be a (1— |)-approximation to Opt. We can easily 
use dynamic programming to calculate the exact value of Optp(m). Denote by 
A{i, k) the maximum weight we can cover with k unit squares in cells Cp, C 2 ,..., c^. 
We have the following DP recursion: 


A{i, k) 


max7“('=’™=) {A{i 
F(ci,fc) 


j) + F(ci,j)} if i>l 
if i = 1 


The running time of the above simple dynamic programming is 0(vn? ■ me)- 
One may notice that each step of the DP is computing a (+,max) convolu¬ 
tion. However, existing algorithms (see e.g., |6l30j l only run slightly better than 
quadratic time. So the improvement would be quite marginal. But in the next 
section, we show that if we would like to settle for an approximation to Optp(m), 
the running time can be dramatically improved to linear. 


4.3 A Greedy Algorithm 

We first apply our MaxCovR(7^, 1) algorithm in Section to each cell Ci, to 
compute a (1 — ^)-approximation of Opt(ci,l). Let /(ci,l) be the return val¬ 
ues. pi This takes 0(n log j) time. Then, we use the selection algorithm to find 
out the m cells with the largest /(ci,l) values. Assume that those cells are 
Cl, ...,Cm,Cm+i, Sorted from largest to smallest by /(c^, 1). 

Lemma 4. Let Opt^ be the maximum weight we can cover using m unit squares 
mci,...,Cm. T/ien Opt'> (1 — ^)Opt 

Proof. Let k be the number of unit squares in Opt that are chosen from Cm-i-i, • • ■, Ct 
This means there must be at least k cells in {cp,... ,Cm} such that Opt does 
not place any unit square. Therefore we can always move all k unit squares 
placed in Cm+i, ■ ■ ■ ,Ct to these empty cells such that each empty cell contains 
only one unit square. Denote the weight of this modified solution by A. Ob¬ 
viously, Opt' > A. For any i,j such that 1 < i < m < j < t, we have 

® Both /(ci, 1) and F(ci, 1) are approximations of Opt(ci, 1), with slightly different 
approximation ratios. 
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Fig. 3. F(ci,fc) (left) and F(ci,A:) (right) 


Opt(ci, 1) > /(ci,l) > /(cj,l) > (1 — ^)Opt(cj,l). Combining with a sim¬ 
ple observation that Opt(ci, k) < fcOpt(ci, 1), we can see that A > (1 — ^)Opt. 
Therefore, Opt' > (1 — ^)Opt. □ 

Hence, from now on, we only need to consider the first m cells {ci, 

We distinguish two cases. If m < 324(i)'*, we just apply the dynamic program to 
Cl, The running time of the above dynamic programming is 

If m > 324(i)'^, we can use a greedy algorithm to find a answer of weight at 
least (I — ^)Optp(TO). 

Let b = (|)^. For each cell Ci, we find the upper convex hull of 2D points 
{(0,F(c,,0)),(l,F(ci,l)), ..., (b,F(ci,b))}. See Figure Suppose the convex 
hull points are {(t^^o, F(cj, F(cj, ti,i)), ... , , F(cj, ti,s,))}, where 

ti,o = 0,ti,si = b. For each cell, since the above points are already sorted from 
left to right, we can compute the convex hull in 0(b) time by Graham’s scan| 2 ()j. 
Therefore, computing the convex hulls for all these cells takes O(TOb) time. 

For each cell Ci, we maintain a value pi representing that we are going to 
place ti^p. squares in cell c^. Initially for all i G [to], pi = 0. In each stage, we 
find the cell such that current slope (the slope of the next convex hull edge) 

F(C2, F(Ci,t2^p^) 

is maximized. Then we add 1 to pi, or equivalently we assign — ti^p. more 

squares into cell c^. We repeat this step until we have already placed at least 
TO — b squares. We can always achieve this since we can place at most b squares 
in one single cell in each iteration. Let m! the number of squares we have placed 
(to = b < to' < to). For the remaining m — m' squares, we allocate them 
arbitrarily. We denote the algorithm by Greedy and let the value obtained be 
Greedy(TO'). Having the convex hulls, the running time of the greedy algorithm 
is 0{m). 

Now we analyze the performance of the greedy algorithm. 

Lemma 5. The above greedy algorithm computes an (1 — e'^/9)-approximation 
to Optp(TO). 
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Fig. 4. F(ci, k) may not be concave: F(ci, 1) = 3, F(ci, 2) = 4, F(ci, 3) = 6. 


Proof. Define an auxiliary function F(ci,fc) as follows: If fc = for some j, 
f(ci,k) = F{ci,k). Otherwise, suppose tij <k < tij+i, then 


F(q, k) = F{c., u,,) + X {k - u,,). 




Intuitively speaking, F(ci,fc)^ee Figure]^ is the function defined by the upper 
convex hull at integer points. NThus, for all i G [m], F(ci, k) is a concave function. 
Obviously, F(ci, k) > F(ci, k) for all i G [m] and all k G [b]. 

Let Optp(i) be the optimal solution we can get from the values F(ci,A:) by 
placing i squares. By the convexity of F(ci,A:), the following greedy algorithm 
is optimal: as long as we still have budget, we assign 1 more square to the cell 
which provides the largest increment of the objective value. In fact, this greedy 
algorithm runs in almost the same way as Greedy. The only difference is that 
Greedy only picks an entire edge of the convex hull, while the greedy algorithm 
here may stop in the middle of an edge (only happen for the last edge). Since 
the marginal increment never increases, we can see that Opt^(i) is concave. 

By the way of choosing cells in our greedy algorithm, we make the following 
simple but important observation: 


Greedy(m') = Opt^(TO') = Optp(m'). 

So, our greedy algorithm is in fact optimal for m'. Combining with m — m' < h 
and the concavity of Optp, we can see that 

Optp(m') > ^ Opt^(m) > OptF(m). 


The last inequality holds because Optp (*) > OptF(i) for any i. 


□ 


^ At first sight, it may appear that F(ci, k) should be a concave function. However, 
this is not true. See Figure for an example. 
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4.4 Computing F(c, fc) 


Now we show the subroutine MaxCovCellM for computing F(c, fc). We use a 
similar partition algorithm as Section [3.2| The only difference is that this time we 
need to partition the cell finer so that the maximum possible weight of points 
between any two adjacent parallel partition lines is After partitioning 

the cell, we enumerate all the possible ways of placing fc unit squares at the grid 
point. Similarly, for each unit square r, we only count the weight of points that 
are in some cell fully covered by r. The algorithm is summarized in Algorithmj^ 
We can adapt the algorithm in m to enumerate these possible choices in 
0((i)'^) time where A = i)). We briefly sketch the algorithm in 

Appendix Now we prove the correctness of this algorithm. 

Lemma 6. MaxCovCellM returns a (1— |) approximate answer for Opt{ci, k). 

Proof. We can use (|)^ unit squares to cover the entire cell, so Opt(ci,fc) > 
By the same argument as in Theorem the difference between Opt(ci, fc) 
and the answer from Algorithm are at most 4fc times the maximum possible 
weight of points between two adjacent parallel partition lines. Therefore, the 
algorithm returns a (1 — |)-approximate answer of Opt(ci, fc). □ 


Now we can conclude the following theorem. 


Theorem 2. Algorithm^returns a (l—e)-approximate answer/or MaxCovR(7^, m) 
in time 


O 




where A = 0(min(i/m, i)). 


Proof. We know that Optp(m) of m selected cells is a (1 — |)-approximation 
to Opt', so the greedy algorithm returns a (1 — |)(1 — ^)-approximation to 
Opt'. Combining with Lemma and Lemma Algorithm returns a (1 — 
^)(1 — |)(1 — ^)(1 — %) approximate solution of the original problem. Since 
(1 — ^)(1 — |)(1 — %)(1 “%)>(!“ Algorithm does return a (1 — e)- 
approximate solution. 

We only need to calculate the overall running time. Solving the values F(ci, 1) 
and finding out the top m results require O(nlogj) time. We compute the 
values F(ci, fc) of m cells. For each cell Ci, we partition it only once and calculate 
F(ci, 1),..., F(ci, Z\) using the same partition. Computing the values F(ci, fc) of 
all m cells requires 0(nlog(-) + time. We do the same for - different 

grids. Therefore, the over all running time is as we state in the theorem. □ 
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A Enumeration In MaxCovCellM 

We can adapt the algorithm in [13] to enumerate these possible ways of placing 
k unit squares at the grid point in time where A — 0{inm{y/m, i)). We 

briefly sketch the algorithm. We denote the optimal solution as Opt^-. From [5] 
we know that for any optimal solution, there exists a line of integer grid (either 
horizontal or vertical) that intersects with 0(i/m) squares in Opt^-, denoted 
as the parting line. So we can use dynamic programming. At each stage, we 
enumerate the parting line, and the O(y^) squares intersecting the parting 
line. We also enumerate the number of squares in each side of the parting line in 
the optimal solution. The total number of choices is 0(( j)^). Then, we can solve 
recursively for each side. In the recursion, we should consider that a subproblem 
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which is composed of a smaller rectangle, and an enumeration of 0{^/rn) squares 
of the optimal solution intersecting the boundary of the rectangle and at most 
m squares fully contained in the rectangle. Overall, the dynamic programming 
can be carried out in 0((i)'^) time. 


B Extension to Other Shapes 


Our algorithm can be easily extended to other shapes, such as unit disks. For 
ease of description, we only consider unit disks, i.e., the MaxCovD(7^, m) problem. 
The algorithm framework is almost the same as before, except several implemen¬ 
tation details required changing from rectangles to disks. The major difference 
is the way for building an £-approximation in each cell (the partition scheme in 
Section 4.4 works only for rectangles). 

Now, we sketch our algorithm, focusing only on the differences. Again, we hrst 
adopt the shifting technique . For unit disk, we consider grids with a different side 
length: G^Q/g(a,b). Again for simplicity, we assume that ^ is an integer and no 
point in V has an integer coordinate. We shift the grid to — different positions: 

(0,0), (2,2),...., (f-2, f-2). Let G = {G56/.(0, 0),..., G^eU^e/s - 2,56/e - 2)} . 
We can prove the following lemma, which is similar to Lemma The only dif¬ 
ference in the proof is that for unit disks, there exists an optimal answer Opt 
such that each covered point is covered by at most 7 disks in Opt instead of 4 
for unit squares. 


Lemma 7. There exist G* S G and a (f — ^)-approximate answer R such that 
all the disks in R do not intersect any line in G*. 

Now consider a hxed grid G G G. For each cell c, we compute a (1 — ^)- 
approximation to Opt(c, 1), in 0(ne“®) time, by applying the following result in 

m- 


Lemma 8 . [T3] Give a set V ofn weighted points and e(0 < e < 1), we can find 
a (1 — e)-approximation for MaxCovD(7^, 1) in 0(ne~^) time. 

Then, we use the selection algorithm to find out the m cells with the largest 
returned values. The rest of our algorithm will only consider those m cells. For 
each such cell c, we want to compute (1 — |)-approximations F(c, k) of Opt(c, k). 
For this purpose, we can use the following lemma from Ha- 

Lemma 9. [13j Give a set V of n weighted points, an integer to > 1 and r >1. 
Let U he the collection of sets that are the union of m unit disks. In range space 
{V, {T^ns I s G t/}), Let A be a (l/2r)-approximation for the range space, where 
r = w[P)/{eMzxQo\to{V,m)). //Opt^ is a optimal solution for MaxCovD(A, to), 
then Opt^ is a {1 — e)-approximation for MaxCovD(7^, to). 

By the above lemma, we only need to build a (l/4r)-approximation A for the 
cell c, where r = Wc/(eOpt(c, fc)) and apply an exact algorithm to A. Since we 
can use 0(l/e^) unit disks to cover the whole cell, we can see that r < 0{l/ke^). 
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We can build the l/4r-approximation A of size 0(^6 log |) in 0(nc • 
time (see e.g., m)- Using the exact algorithm in m for MaxCovD(7^, m), we 
can compute the exact solution for A in time. 

Suppose we have computed all f{c,k) values for the first m cells. If m < 
0(l/e^), we apply the dynamic programming, as in Section |4.2[ Otherwise, we 


apply the algorithm Greedy in Section 4.3 The correctness proof is exactly 
the same, except that the approximation ratio in Lemma changes slightly to 

Now we can summarize the overall running time. Computing the value F(c, 1) 
for each cell requires 0(ns~^) time. We build e-approximations in m different 
cells and k is at most min{0(l/e^), m}. Therefore the overall running time is 
0 („(l) 0 (i) where = 0(min(v^, A). 



